Molecular Descriptors Property Prediction Using Transformer-Based Approach
نویسندگان
چکیده
In this study, we introduce semi-supervised machine learning models designed to predict molecular properties. Our model employs a two-stage approach, involving pre-training and fine-tuning. Particularly, our leverages substantial amount of labeled unlabeled data consisting SMILES strings, text representation system for molecules. During the stage, capitalizes on Masked Language Model, which is widely used in natural language processing, chemical space representations. fine-tuning trained smaller dataset tackle specific downstream tasks, such as classification or regression. Preliminary results indicate that demonstrates comparable performance state-of-the-art chosen tasks from MoleculeNet. Additionally, reduce computational overhead, propose new approach taking advantage 3D compound structures calculating attention score end-to-end transformer anti-malaria drug candidates. The show using proposed score, able have with pre-trained models.
منابع مشابه
Systems Biological Approach of Molecular Descriptors Connectivity: Optimal Descriptors for Oral Bioavailability Prediction
BACKGROUND Poor oral bioavailability is an important parameter accounting for the failure of the drug candidates. Approximately, 50% of developing drugs fail because of unfavorable oral bioavailability. In silico prediction of oral bioavailability (%F) based on physiochemical properties are highly needed. Although many computational models have been developed to predict oral bioavailability, th...
متن کاملTransformer protection using MLE approach
This paper proposed a new wavelet based method to identify inrush currents and to distinguish it from power system faults. The proposed algorithm extracts fault and inrush generated transient signals using DWT. Transient current signals at both sides of a transformer are firstly captured. The wavelet transform is a relatively new and powerful tool in the analysis of power transformer transient ...
متن کاملIn-silico prediction of Cellular Responses to Polymeric Biomaterials from Their Molecular Descriptors
In this work quantitative structure activity relationship (QSAR) methodology was applied for modeling and prediction of cellular response to polymers that have been designed for tissue engineering. After calculation and screening of molecular descriptors, linear and nonlinear models were developed by using multiple linear regressions (MLR) and artificial neural network (ANN) methods. The root m...
متن کاملPrediction and Dissection of Protein-RNA Interactions by Molecular Descriptors.
Protein-RNA interactions play crucial roles in numerous biological processes. However, detecting the interactions and binding sites between protein and RNA by traditional experiments is still time consuming and labor costing. Thus, it is of importance to develop bioinformatics methods for predicting protein-RNA interactions and binding sites. Accurate prediction of protein-RNA interactions and ...
متن کاملPrediction of pesticides chromatographic lipophilicity from the computational molecular descriptors.
Quantitative structure-property relationship models were developed for the prediction of pesticides and some PAH compounds lipophilicity based on a wide set of computational molecular descriptors and a set of experimental chromatographic data. The chromatographic lipophilicity of pesticides has been evaluated by high-performance liquid chromatography (HPLC) using different chemically bonded (C1...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: International Journal of Molecular Sciences
سال: 2023
ISSN: ['1661-6596', '1422-0067']
DOI: https://doi.org/10.3390/ijms241511948